Accurate `Tensor.device` for TFEager backends #1077

texasmichelle · 2020-09-11T04:23:01Z

This modifies Tensor.device for TFEager backends to retrieve device details from the underlying eager libraries. This is more accurate than returning Device.defaultTFEager and reflects the actual location of eager operations.

This was tested by building a CUDA toolchain and verifying that TFE_TensorHandleDeviceName() returns:

/job:localhost/replica:0/task:0/device:GPU:0

and a GPU device is returned by Tensor.device:

Device(kind: .GPU, ordinal: 0, backend: .TF_EAGER)

On a device without GPU, TFE_TensorHandleDeviceName() returns:

/job:localhost/replica:0/task:0/device:CPU:0

and a CPU device is returned by Tensor.device:

Device(kind: .CPU, ordinal: 0, backend: .TF_EAGER)

Fixes tensorflow/swift#524.

Sources/x10/swift_bindings/Device.swift

pschuh · 2020-09-11T15:02:54Z

Sources/x10/swift_bindings/Device.swift

+      // Parse type and ordinal from a string with the expected syntax:
+      //   /job:localhost/replica:0/task:0/device:CPU:0
+      let pattern = ".+device:(.+):(\\d+)$"
+      let regex = try! NSRegularExpression(pattern: pattern)


This string parsing looks expensive. Might want to double check that this doesn't happen too much.

pschuh · 2020-09-11T22:45:49Z

Sources/TensorFlow/Core/Tensor.swift

+
+            // Parse type and ordinal from a string with the expected syntax:
+            //   /job:localhost/replica:0/task:0/device:CPU:0
+            let pattern = ".+device:(.+):(\\d+)$"


Maybe break this String -> Device out as a separate function?
Also, I'm concerned that the string parsing will be expensive. This function is called a lot. (whenever there is a scalar constant or anything like that). I think it would be best to see if you can add your own TFE_TensorHandleDevice_Type and TFE_TensorHandleDevice_Id. Some benchmarking results might work instead.

texasmichelle · 2020-09-15T15:32:25Z

Different approach coming. Closing this PR

texasmichelle added 11 commits July 3, 2020 18:24

Remove rm statement

ebf0b87

Merge remote-tracking branch 'upstream/master'

6a91e58

Merge remote-tracking branch 'upstream/master'

b5de349

Merge remote-tracking branch 'upstream/master'

e3d1347

Merge remote-tracking branch 'upstream/master'

4b7ec30

Merge remote-tracking branch 'upstream/master'

4acc515

Merge remote-tracking branch 'upstream/master'

3500c5a

Merge remote-tracking branch 'upstream/master'

94e2c75

Merge remote-tracking branch 'upstream/master'

7a5aebc

Merge remote-tracking branch 'upstream/master' into eager_device

d49d344

Retrieve device from TFEager libraries

673182c

texasmichelle requested a review from pschuh September 11, 2020 04:23

texasmichelle commented Sep 11, 2020

View reviewed changes

Sources/x10/swift_bindings/Device.swift Outdated Show resolved Hide resolved

texasmichelle changed the title ~~Eager device~~ Accurate Device.defaultTFEager Sep 11, 2020

pschuh approved these changes Sep 11, 2020

View reviewed changes

Move to Tensor.device

3547ab1

texasmichelle changed the title ~~Accurate Device.defaultTFEager~~ Accurate Tensor.device for TFEager backends Sep 11, 2020

pschuh reviewed Sep 11, 2020

View reviewed changes

texasmichelle closed this Sep 15, 2020

brettkoonce mentioned this pull request Sep 22, 2020

What's the expected output from Device.allDevices ? #1089

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Accurate `Tensor.device` for TFEager backends #1077

Accurate `Tensor.device` for TFEager backends #1077

texasmichelle commented Sep 11, 2020 •

edited

Loading

pschuh Sep 11, 2020

pschuh Sep 11, 2020

texasmichelle commented Sep 15, 2020

Accurate Tensor.device for TFEager backends #1077

Accurate Tensor.device for TFEager backends #1077

Conversation

texasmichelle commented Sep 11, 2020 • edited Loading

pschuh Sep 11, 2020

Choose a reason for hiding this comment

pschuh Sep 11, 2020

Choose a reason for hiding this comment

texasmichelle commented Sep 15, 2020

Accurate `Tensor.device` for TFEager backends #1077

Accurate `Tensor.device` for TFEager backends #1077

texasmichelle commented Sep 11, 2020 •

edited

Loading